Support partial downloads #1020

jelford · 2017-03-31T20:34:53Z

This PR should close #889

A couple of implementation details:

Only added support for the curl backend; previously discussed that there's an intention to get rid of rustup's own download code, and the default feature-set uses curl anyway, so hopefully this is okay.

Added new testing to the download crate - while it's there, it makes sense to have a test. Since using curl's "resume" functionality, I figured it's probably fine to just file:// urls for test cases. Previously tested using a small hyper-based http server, but that feels like overkill.

For hashing files, I've set the buffer size to 2^15 - just because that's what strace tells me is used by sha256sum on my local PC. It seems much slower than that command though, and it's not obvious why, so maybe I've done something silly here.

Finally, and maybe most controversially, I haven't done anything about cleaning up aborted partials. I don't really know when a good time is to do this, but a couple of suggestions that I'd be happy to implement:

Every run, just check the download cache for any files > 7 days old and smoke them
On self-update, as that seems like a natural time for generic "maintenance" sorts of operations

I mentioned in my last PR, but the same disclaimer: I haven't written much rust, so I fully expect you will see some problems (also very happy to accept style criticisms). I accidentally ran a rustfmt on some things so apologies for the noise (can revert but... maybe it's worth having anyway?).

* Adds support only for the Curl backend, which is the default anyway. * In order to distinguish between a file that has been fully downloaded (but not used yet) and should therefore be hash-checked vs. those that require more data, store partials with a .partial extension. * Adds a simple http-server to rustup-mock, to allow the download module to be properly tested. It's not clear how to easily emulate a server that stops half-way without that. The tests for the overall download-resumption functionality should be fairly re-usable if we migrate to another download solution in the future (e.g. in rust-lang#993) * Don't bother with resumption for meta-data files, since they're likely to go out of date anyway.

Allows the removal of http test server, and simplifies test cases as we can just read/write files on disk, like for existing dist tests

…hes, and make consistent with partial download code.

jelford · 2017-03-31T22:51:24Z

Not sure about that travis error: "couldn't find crate for std." I don't know whether there's a way to just give it another kick?

Diggsey · 2017-03-31T23:57:25Z

I restarted that job. The appveyor issues are legit though.

I don't know why I put it there in the first place

jelford · 2017-04-01T00:02:46Z

Yes, that could definitely never work. Don't know why I put the conditional there. I can't check my work though, don't have a windows box, but think I've just pushed a fix.

Diggsey · 2017-04-01T16:17:10Z

Thanks @jelford
Could you revert the formatting changes from this PR - the rust style guidelines are currently in the process of being changed (away from visual indentation by the look of things), so I don't think we should reformat anything at this time, and even if we did, it should be separate from functional changes to the code.

While it would be nice to move the download code out of rustup, I'm not sure that the intent is to switch to curl though? I'm not necessarily opposed to adding this kind of feature just for the curl backend though if it will help people on poor connections - @brson thoughts?

jelford · 2017-04-01T17:49:29Z

Thanks for the quick response @Diggsey.
I've reverted those formatting changes, sorry for the noise.

On switching to curl, I just meant to observe that this is the default, so most people should get the benefit right away. I'm hoping 90% of this goes away as part of #993, so I wasn't keen to spend too long getting it into the hyper-backend.

As a side note, I only took a quick look and I don't think reqwest currently has support for resume as a 1st-class thing. I was thinking it's easy enough just to set the appropriate header, but need to think about the error case where the server doesn't support it. libcurl will just bomb out when it gets a response without the Content-Range header. Is that actually fine? Do all the mirrors support it? It works for me locally, but I don't know whether it's S3 everywhere or something else in different parts of the world. If not, it would be a shame to break the whole thing for what is essentially a bandwidth optimization.

Options in that case would be:

add an environment variable to de-activate resume requests
fail gracefully from the specific curl error and fall back to downloading the whole file.

The former is simple, but not as nice a UI.

brson · 2017-04-05T00:37:50Z

While it would be nice to move the download code out of rustup, I'm not sure that the intent is to switch to curl though? I'm not necessarily opposed to adding this kind of feature just for the curl backend though if it will help people on poor connections - @brson thoughts?

My intent is to switch to reqwest, assuming that reqwest uses native-tls, and someday gets support for rustls. The rustup download crate was supposed to basically be reqwest, but at this point it seems best to do the hyper/rustls transition in the ecosystem, not in rustup. Adding resume support just means we'll have to make reqwest support that too (or otherwise create a crate that does what reqwest does with resume support).

brson · 2017-04-05T00:39:28Z

Good tests, thanks @jelford.

brson · 2017-04-05T00:47:35Z

Is that actually fine? Do all the mirrors support it? It works for me locally, but I don't know whether it's S3 everywhere or something else in different parts of the world. If not, it would be a shame to break the whole thing for what is essentially a bandwidth optimization.

I think we can not assume much about the servers, because people will at some point host their own rustup mirrors.

brson · 2017-04-05T00:48:07Z

Patch looks good to me, r=me if @Diggsey is happy.

Diggsey · 2017-04-05T00:54:10Z

@bors r=brson

bors · 2017-04-05T00:54:11Z

📌 Commit c63b275 has been approved by brson

bors · 2017-04-05T01:41:38Z

⌛ Testing commit c63b275 with merge 8c481c8...

Support partial downloads This PR should close #889 A couple of implementation details: Only added support for the curl backend; previously discussed that there's an intention to get rid of rustup's own download code, and the default feature-set uses curl anyway, so hopefully this is okay. Added new testing to the download crate - while it's there, it makes sense to have a test. Since using curl's "resume" functionality, I figured it's probably fine to just file:// urls for test cases. Previously tested using a small hyper-based http server, but that feels like overkill. For hashing files, I've set the buffer size to 2^15 - just because that's what strace tells me is used by `sha256sum` on my local PC. It seems much slower than that command though, and it's not obvious why, so maybe I've done something silly here. Finally, and maybe most controversially, I haven't done anything about cleaning up aborted partials. I don't really know when a good time is to do this, but a couple of suggestions that I'd be happy to implement: * Every run, just check the download cache for any files > 7 days old and smoke them * On self-update, as that seems like a natural time for generic "maintenance" sorts of operations I mentioned in my last PR, but the same disclaimer: I haven't written much rust, so I fully expect you will see some problems (also very happy to accept style criticisms). I accidentally ran a `rustfmt` on some things so apologies for the noise (can revert but... maybe it's worth having anyway?).

bors · 2017-04-05T02:39:38Z

💔 Test failed - status-appveyor

brson · 2017-04-08T01:37:26Z

Windows flakiness

jelford added 6 commits March 20, 2017 13:08

Add a (verbose-level) notification that partial download has resumed.

146d3b9

Clean up unused muts etc. thanks to rustc lints

71fe60c

Include download crate's tests in CI script

7aa8b57

Use libcurl's support for download resumption directly.

1286686

Allows the removal of http test server, and simplifies test cases as we can just read/write files on disk, like for existing dist tests

Use a larger buffer size when checking previously downloaded file has…

d9f34d8

…hes, and make consistent with partial download code.

Remove unix-only flag on std::io::Error error_chain foreign link

2a02926

I don't know why I put it there in the first place

Undo accidental formatting changes

c63b275

brson merged commit 4cdbe1b into rust-lang:master Apr 8, 2017

jelford mentioned this pull request May 7, 2017

Support download continuation (similar to curl -C -) #750

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support partial downloads #1020

Support partial downloads #1020

jelford commented Mar 31, 2017

jelford commented Mar 31, 2017

Diggsey commented Mar 31, 2017

jelford commented Apr 1, 2017

Diggsey commented Apr 1, 2017

jelford commented Apr 1, 2017

brson commented Apr 5, 2017

brson commented Apr 5, 2017

brson commented Apr 5, 2017 •

edited

Loading

brson commented Apr 5, 2017

Diggsey commented Apr 5, 2017

bors commented Apr 5, 2017

bors commented Apr 5, 2017

bors commented Apr 5, 2017

brson commented Apr 8, 2017

Support partial downloads #1020

Support partial downloads #1020

Conversation

jelford commented Mar 31, 2017

jelford commented Mar 31, 2017

Diggsey commented Mar 31, 2017

jelford commented Apr 1, 2017

Diggsey commented Apr 1, 2017

jelford commented Apr 1, 2017

brson commented Apr 5, 2017

brson commented Apr 5, 2017

brson commented Apr 5, 2017 • edited Loading

brson commented Apr 5, 2017

Diggsey commented Apr 5, 2017

bors commented Apr 5, 2017

bors commented Apr 5, 2017

bors commented Apr 5, 2017

brson commented Apr 8, 2017

brson commented Apr 5, 2017 •

edited

Loading